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Entwicklung, psychometrische Merkmale und Validitat in einer 
interkulturellen nicht-klinischen Stichprobe und in einer Stichprobe von 
Herzinsuffizienz-Risikopatienten 



Abstract 

Objective: General self-efficacy has been found to be an influential 
variable related to the adaptation to stress and chronic illness, with the 
General Self-Efficacy (GSE) Scale by Jerusalem and Schwarzer being a 
reliable and valid instrument to assess this disposition. The aim of this 
study was to construct and test a short form of this scale to allow for a 
more economical assessment of the construct. 
Methods: The item characteristics of the original scale were assessed 
using an intercultural non-clinical sample (n=19,719). Six items with 
the highest coefficient of variation and good discrimination along the 
range of the trait were selected to build a short form of the instrument 
(GSE-6). Subsequently, the psychometric properties and the concurrent 
and predictive validity of the GSE-6 were tested in a longitudinal design 
with three measurements using a sample of patients with risk factors 
for heart failure (n=l,460). 

Results: Cronbach's alpha for the GSE-6 was between .79 and .88. We 
found negative associations with symptoms of depression (-.35 and 
-.45), anxiety (-.35), and vital exhaustion (-.38) and positive associ- 
ations with social support (.30), and mental health (.36). In addition, 
the GSE-6 score was positively associated with active problem-focused 
coping (.26) and distraction/self-encouragement (.25) and negatively 
associated with depressive coping (-.34). The baseline GSE-6 score 
predicted mental health and physical health after 28 months, even after 
controlling for the respective baseline score. The relative stability over 
twelve and 28 months was r=.50 and r=.60, respectively, while the 
mean self-efficacy score did not change over time. 
Conclusions: The six item short form of the GSE scale is a reliable and 
valid instrument that is useful for the economical assessment of general 
self-efficacy in large multivariate studies and for screening purposes. 

Keywords: self efficacy, questionnaires, psychometrics, heart diseases 

Zusammenfassung 

Hintergrund: Die generalisierte Selbstwirksamkeit hatsich als einfluss- 
reiche Variable im Zusammenhang mit der Anpassung an stressreiche 
Situationen und chronische Erkrankungen gezeigt. Die Skala Generali- 
sierte Selbstwirksamkeit (GSW) von Jerusalem und Schwarzer ist ein 
reliables und valides Instrument zur Erhebung dieser Disposition. Ziel 
der vorliegenden Studie war die Entwicklung und Prufung einer Kurzform 
dieser Skala, urn eine okonomischere Erfassungdes Konstruktszu er- 
moglichen. 
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Methoden: Die Itemmerkmale der Original-Skala wurden anhand der 
Daten einer interkulturellen, nicht-klinischen Stichprobe (n=19.719) 
bestimmt. Sechs Items mit dem hochsten Variationskoeffizienten und 
guter Diskrimination uber den Merkmalsbereich wurden ausgewahlt, 
um aus ihnen eine Kurzform des Instruments zusammenzustellen (GSW- 
6). AnschlieSend wurden psychometrische Merkmale und die konkur- 
rente und pradiktive Validitat der GSW-6 in einem Langsschnittdesign 
mit drei Messzeitpunkten an einer Stichprobe von Patienten mit Risiko- 
faktoren fur eine Herzinsuffizienz gepruft (n=1.460). 
Ergebnisse: Cronbachs alpha fur die GSW-6 lagzwischen .79 und .88. 
Wir fanden negative Zusammenhange mit depressiven Symptomen 
(-.35 und -.45), Angstsymptomen (-.35), und vitalerErschopfung(-.38) 
sowie positive Zusammenhange mit sozialer Unterstutzung (.30) und 
psychischer Gesundheit (.36). Weiterhin hing der GSW-6-Score positiv 
mit aktivem problemorientiertem Coping (.26) und Ablenkung/Selbst- 
aufbau (.25) sowie negativ mit depressiver Krankheitsverarbeitung 
(-.34) zusammen. Der GSW-6-Ausgangswert konnte die psychische 
und kdrperliche Gesundheit nach 28 Monaten, auch nach Kontrolle 
des jeweiligen Ausgangswertes, vorhersagen. Die relative Stabilitat uber 
zwdlf bzw. 28 Monate betrug r=.50 und r=.60, wahrend sich der mittlere 
Selbstwirksamkeitsscore im Zeitverlauf nichtanderte. 
Schlussfolgerungen: Die aus sechs Items bestehende Kurzform der 
GSW-Skala ist ein reliables und valides Instrument, das zur okonomi- 
schen Erfassung der generalisierten Selbstwirksamkeit in groSen mul- 
tivariaten Studien und zum Einsatz als Screeninginstrument geeignet 
ist. 

Schliisselworter: Selbstwirksamkeit, Fragebogen, Psychometrie, 
Herzerkrankungen 



Introduction 

Over the past years there has been increasing evidence 
for the role of personal dispositions like optimism, self- 
esteem, and self-efficacy as protective factors in adapting 
to stress and chronic illness [1], [2], [3], [4], [5], [6], [7]. 
Self-efficacy has been found to be positively associated 
with mobility, activities of daily living, and quality of life 
in stroke patients [3], [4], with family and social function- 
ing in coronary heart disease patients [8], with self-care 
in youngadults with type I diabetes [9], with physical well- 
being in coronary heart disease patients [8], osteoarthritis 
patients [1], and spinal cord injury patients [7], as well 
as with psychological well-being in cancer patients [2], 
[10], stroke patients [3], [4], osteoarthritis patients [1], 
and spinal cord injury patients [7]. 
In social-cognitive theory [11] the construct of self-efficacy 
refers to the belief that a person is able to control chal- 
lenging environmental demands by taking adaptive action. 
Although generally thought of as being domain-specific, 
the concept of generalized self-efficacy, representing a 
broad and stable confidence in one's ability to deal with 
different demanding situations, has been suggested [12]. 
The General Self-Efficacy (GSE) Scale, originally developed 
by Jerusalem and Schwa rzer in 1979, aims to assess this 
general attribute. In the most recent version it consists 
of 10 items and has been adapted to several languages 
[13]. In different studies the scale showed internal con- 
sistencies between alpha = .75 and .94 [13], [14]. The 



retest-reliability was found to be between .47 and .75 for 
time periods rangingfrom 6 months to two years [13]. In 
a sample of high-school students positive correlations 
have been found with optimism and the perception of 
challenge in stressful situations, in a sample of teachers 
positive correlations with proactive coping, and self-regu- 
lation and negative correlations with procrastination and 
symptoms of burnout were found [13]. For clinical 
samples, negative correlations with depressive symptoms 
and anxiety as well as positive correlations with the use 
of adaptive, active coping strategies and quality of life 
measures have been reported [14], [15], [16]. 
Although the scale in its current form is already very effi- 
cient, in some cases an even shorter form of the ques- 
tionnaire may be desirable for inclusion into large mul- 
tivariate studies or for screening purposes [17]. The space 
and time savings gained by shortening the instrument 
are directly proportional to the reduction in length and 
have to be weighed up against the loss in reliability and 
validity to be expected. When constructing a short form 
of an assessment instrument and thus reducing the 
number of items, a central goal is to preserve the content 
coverage. Using the item-total correlation or the height 
of factor loadings as a criterion for the selection of the 
items tends to narrow the construct, thus compromising 
the validity, while keeping the reliability to the highest 
degree [17]. Instead of this, selecting items that capture 
the content of the construct to be assessed as broadly 
as possible and selecting items that are able to discrim- 
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inate among participants at different levels of the trait 
are alternative criteria deemed more useful [18]. 
The aim of the current study is to construct a short form 
of the General Self-Efficacy Scale, usingthe item selection 
criteria as discussed above, and subsequently to test the 
psychometric properties of the new instrument, to deter- 
mine its reliability and stability and to analyze its conver- 
gent, discriminant, and predictive validity. 

Methods 
Sample 1 

Sample 1 consists of n=19,719 participants from 26 
different countries, that responded to their respective 
language version oftheGSE scale, with participants with 
missing response on any of the GSE items (n=177 of the 
total n=19,896) excluded. To build the sample, samples 
from different countries and different studies were com- 
bined. The composition of the subsamples and the re- 
spective data collection procedures vary. The data 
(without the Swiss sample, n=776), available as an SPSS 
file for free download at the GSE website [19], have pre- 
viously been used to analyze the psychometric properties 
and cross-cultural differences of the general self-efficacy 
scale [13]. The sample consists of n=7,415 males and 
n=9,624 females (with n=2, 680 not statingtheir gender), 
ranging in age from 12 to 94 years with a mean age 
(standard deviation) of 25.1 (14.5) years (missing data 
for age for n=4,489). 

Instrument 

The 10 item version of the GSE scale [12] in different 
translations was used. According to the FAQ document 
available at the GSE website [19], the scale can be repro- 
duced and used without explicit permission in research 
studies, as longas the source of the scale is given appro- 
priately. The participants were asked to rate the degree 
to which each item applies to them on a scale ranging 
from "not at all true" (1) to "exactly true" (4). For the 
summary score the item scores are summed up. 

Analysis 

To select the items for the short version, the psychometric 
properties of the items were determined usingthe results 
of sample 1. As to the numbers of items to select, the 
Spearman-Brown formula [20] can be used to estimate 
the reliability to be expected with a reduced number of 
items in comparison with the full number of items. If reli- 
abilities of .90 respective .80 are assumed for the ten 
items of the original GSE scale a reduction to six items 
will reduce the reliability to .84 respective .71, which is 
deemed the maximum loss in reliability that is acceptable, 
thus the number of items to be selected is set at six. For 
both the 10 item and the 6 item version Cronbach's alpha 



was used as a measure for the internal consistency of 
the scale. 

To account for the hierarchical structure of the sample, 
variance components were estimated in a multilevel 
model with countries at level 1 and participants at level 
2, assuming random level effects and treating the items 
as metric variables. Using an item response theory ap- 
proach, item parameters were estimated using a two- 
parameter logistic model based on the graded response 
model [21]. In this approach, the ordered polytomous 
response options are mapped to a latent trait variable, 
resulting in a discrimination parameter (slope) and a dif- 
ficulty parameter for each category bound (thresholds). 
The higher the discrimination parameter, the closer the 
relationship of the item to the construct. The difficulty 
parameters indicate the values on the latent trait, at 
which there is a 50% probability to choose the respective 
higher response category. Descriptive statistics for sample 
1 were calculated using IBM SPSS Statistics version 20. 
For the estimation of the multilievel model and the item 
response theory model we used Mplus version 6.1 [22]. 

Sample 2 

Sample 2 consists of data from n=l,460 patients with 
risk factors for heart failure (diabetes, hypertension, sleep 
apnea, or coronary heart disease), including 207 patients 
with previously diagnosed heart failure, who completed 
the GSE scale as part of the larger longitudinal DIAST-CHF 
study, which has been described previously [23], [24]. 
Briefly, in 2004 and 2005 primary care physicians re- 
ferred outpatients aged 50 to 85 with the above-men- 
tioned risk factors for comprehensive cardiological and 
psychometric evaluation at one of the participating cen- 
ters. There were no exclusion criteria, except for unwill- 
ingness or inability (e.g., for language reasons) to partici- 
pate. The study was approved by the responsible ethics 
committees and all participants provided their written 
informed consent. The sample was almost balanced by 
sex with 51.5% men and 48.5% women. The mean age 
was 66.7 years (s=8.0 years; range: 38 to 87 years) (n=5 
missing data for gender and age). Several self-report 
questionnaires were administered at baseline and at two 
follow-ups, twelve (n=973) and 28 months (n=859) 
thereafter. For both follow-up assessments, patients were 
invited to come to their respective study centers for car- 
diological and psychometric assessments. 

Instruments 

The short form of the General Self Efficacy scale (GSE) 
that was constructed with the results of sample 1 was 
used with sample 2 to determine psychometric properties 
and to examine the construct and the prognostic validity 
in an independent clinical sample. The same response 
format as described above was used. 
The depression scale of the Patient Health Questionnaire 
(PHQ-9) [25], [26] consists of nine items, assessing dif- 
ferent depressive symptoms based on the diagnostic 
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criteria for major depressive disorder in the Diagnostic 
and Statistical Manual Fourth Edition (DSM-IV). On a 
4-point Likert-type scale ranging from "not at all" to 
"nearly every day", respondents are asked to rate the 
degree to which each symptom applied to them over the 
last two weeks. Items are scored from Oto 3 and summed 
up to build a summary score (range 0 to 27) with higher 
values signifying more severe depressive symptoms. 
The SF-36 questionnaire [27], [28] assesses functional 
health and well-being with 36 items on eight scales, from 
which physical and mental health summary scores are 
derived. The summary scores are transformed to T-scores 
with mean = 50 and standard deviation = 10, with higher 
values implying better health. 

The Hospital Anxiety and Depression Scale (HADS) [29], 
[30] consists of 14 items, with 7 items assessing the in- 
tensity of depressive and anxiety symptoms, respectively. 
Items are scored on a 4-point scale ranging from 0 to 3 
and summed up to a depression and an anxiety scale 
(range: 0 to 21). 

The Maastricht Questionnaire (MQ) [31] consists of 
21 items that describe different aspects of vital exhaus- 
tion with a possible score range from 0 to 42. 
The ENRICHD Social Support Inventory (ESSI) [32], [33] 
is a 5-item questionnaire assessing social support. The 
items, scored 1 to 5, are summed up for a total score, 
with higher scores indicating greater social support (total 
score range: 5 to 25). 

The short form of the Freiburg Questionnaire for Coping 
with Illness (Freiburger Fragebogen zur Krankheitsver- 
arbeitung, FKV-LIS) [34] consists of 35 items that assess 
different coping styles. The items are rated on a 5-point 
Likert-type scale ranging from 1 ("not relevant at all") to 
5 ("very relevant"). Items are summed up to build five 
scales representing different coping dimensions: 1) de- 
pressive coping, 2) active problem-focused coping, 
3) distraction and self-encouragement, 4) religious faith 
and search for meaning (5 items each), as well as 
5) minimisation and wishful thinking (3 items). The pos- 
sible scale score range is 5 to 25 for the first four scales 
and 3 to 15 for the last scale. 

Analysis 

After imputation of missing baseline scores using multiple 
imputation with the fully conditional specification ap- 
proach with five imputation datasets, correlations 
between the GSE-6 score and the baseline scale scores 
of all instruments described above were calculated. In a 
second step, the SF-36 physical and mental health scores 
of the second and third measurement (after 12 and 
28 months) were predicted using a linear regression 
model with the baseline GSE-6 score and the respective 
baseline health score as predictors. Finally, the relative 
and absolute stability of the GSE-6 score over the three 
measurement points was determined using Pearson 
correlations and a repeated measures analysis of vari- 
ance. All statistical analyses for sample 2 were done using 
IBM SPSS Statistics version 20. 



Results 
Sample 1 

The items of the General Self-Efficacy Scale were found 
to be very similar in content, thus limiting the usefulness 
of the first item selection criterion (selecting items that 
capture the content of the construct to be assessed as 
broadly as possible). Instead it appears more reasonable 
to select those items that show the highest variability in 
item scores, thus allowingfor good discrimination among 
participants at different levels of the trait. 
Table 1 shows the means, variances within and between 
clusters, the intraclass correlation coefficients, the coef- 
ficients of variation, and the corrected item-total correla- 
tion for the items as well as the discrimination and diffi- 
culty parameters estimated from the item response theory 
model. The intraclass correlation coefficients ranged 
between .09 and .26, indicating that between 9% and 
26% of the total item variance exists between the coun- 
tries. Item-total correlation coefficients were between .44 
and .60. Item discrimination parameters were between 
1.11 and 2.16, corresponding to standardized factor 
loadings between .52 and .77. The item difficulty para- 
meters ranged between -3.50 and +1.05, indicatingthat 
the items are better at differentiating at lower levels of 
the latent trait. 

The six items selected for the short form were those with 
the highest coefficients of variation. Among the selected 
items were four items with high discrimination parameters 
(items 5, 6, 7, and 10) and two items that are able to 
discriminate at the lowerand especially at the upper end 
of the scale due to low or very high difficulty parameters 
(items 2 and 3). 

Cronbach's alpha for the total sample was .85 for the 
10 item scale and .79 for the 6 item scale. While for the 
10 item scale the alphas ranged from .75 (India) to .91 
(Japan), for the 6 item scale they ranged between .64 
(Syria) and .85 (Japan). The correlation between the 10 
item sum score and the 6 item sum score for the total 
sample was .96, ranging between .93 (Portugal) and .97 
(Japan) for the different subsamples. 

Sample 2 

In sample 2, Cronbach's alpha for the GSE-6 scale was 
.86, .88, and .88 for the first, second, and third measure- 
ment, respectively. Associations of the GSE-6 scores with 
age were small (r=.03, r=.07, and r=.05, respectively) 
with slightly higher scores for males (Cohen's d=0.05, 
cNO.ll, and d=0.05, respectively). The correlations 
between the baseline GSE-6 score and the baseline 
scores of the other self-report instruments are shown in 
Table 2. As expected, the GSE-6 shows negative associ- 
ations with symptoms of depression and anxiety as well 
as with vital exhaustion, and positive associations with 
social support, mental health, and, to a lesser degree, 
with physical health. With regard to the coping dimen- 
sions, the GSE-6 score is positively associated with active 
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Table 1: Item characteristics and estimated item parameters for the items of the General Self-Efficacy Scale in sample 1 

(n=19,719) 

Item Item M s w 2 Sb 2 ICC cv nt a bi b2 b3 
No. 

1 I can always manage to solve 3.14 0.52 0.05 0.09 0.23 0.46 1.22 -3.50 -1.67 0.73 
difficult problems if I try hard 
enough. 

*2 If someone opposes me, I can 2.96 0.54 0.19 0.26 0.25 0.44 1.11 -3.14 -1.16 1.05 
find means and ways to get what 
I want. 

*3 It is easy for me to stick to my 2.80 0.61 0.16 0.21 0.28 0.53 1.44 -2.15 -0.75 1.25 
aims and accomplish my goals. 

*4 I am confident that I could deal 2.85 0.58 0.12 0.18 0.27 0.61 1.95 -2.09 -0.71 1.05 
efficiently with unexpected 
events. 

*5 Thanks to my resourcefulness, I 2.90 0.53 0.14 0.21 0.25 0.64 2.16 -2.11 -0.82 0.99 
know how to handle unforeseen 
situations. 

6 I can solve most problems if I 3.03 0.55 0.10 0.15 0.24 0.53 1.48 -2.71 -1.15 0.80 
invest the necessary effort. 

*7 I can remain calm when facing 2.95 0.61 0.10 0.14 0.26 0.60 1.85 -2.28 -0.88 0.87 
difficulties because I can rely on 
my coping abilities. 

8 When I am confronted with a 2.98 0.55 0.08 0.13 0.25 0.58 1.70 -2.59 -0.95 0.88 
problem, I can usually find 

several solutions. 

9 If I am in trouble, I can usually 3.01 0.50 0.07 0.12 0.23 0.60 1.89 -2.69 -1.07 0.92 
think of something to do. 

*10 No matter what comes my way, 2.97 0.60 0.12 0.17 0.26 0.60 1.83 -2.33 -0.89 0.83 
I'm usually able to handle it. 

M - mean; s w 2 - variance within; sb 2 - variance between; ICC - intraclass correlation coefficient; cv - coefficient of variation 
within; r t - corrected item-total correlation; a - discrimination parameter; b - difficulty parameters . Variance components and 

ICC were estimated in a multilevel model, discrimination and difficulty parameters in a two parameter logistic item response 
theory model. Selected items are marked with an *. 



Table 2: Correlations between GSE-6 scores and other self-report scale scores in sample 2 (n=l,460) 



Scale r 

depression (PHQ-9) -.35 
physical health (SF-36) .11 
mental health (SF-36) .36 

vital exhaustion (MQ) -.38 

depression (HADS) -.45 

anxiety (HADS) -.35 
social support (ESSI) .30 

depressive coping (FKV) -.34 
active problem-focused coping (FKV) .26 
distraction and self-encouragement (FKV) .25 
religious faith and search for meaning (FKV) .04 

minimization and wishful thinking (FKV) -.19 



PHQ-9 - Patient Health Questionnaire; SF-36 - 36-ltem 
Short-Form Health Survey; HADS - Hospital Anxiety and 
Depression Scale; MQ - Maastricht Questionnaire; 
ESSI - ENRICHD Social Support Inventory; 
FKV - Freiburger Fragebogen zur Krankheitsverarbeitung; 
all p<.01 except for "religious faith and search for meaning" 
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problem-focused coping and distraction/self-encourage- 
ment and negatively associated with depressive coping 
and minimization/wishful thinking. 
After controlling for the respective SF-36 baseline score, 
baseline general self-efficacy predicted the mental health 
summary score and the physical health summary score 
after 28 months. The regression weight for the mental 
health summary score was o=.49, which is significant at 
p<.001 (f=4.76; n=733) with a 95% confidence interval 
reaching from .29 to .70. The regression weight for the 
physical health summary score was o=.22, which is signi- 
ficant atp=.029 (t=2.18; n=733) with a 95% confidence 
interval reaching from .02 to .42. The prediction of the 
health summary scores after 12 months didn't reach 
significance. The respective regression weights were 
b=.21 (f=1.81; n=618; p=.07; 95% confidence interval 
-.02 to .44) for mental health and b=.18 [f=1.62; n=618; 
p=.ll; 95% confidence interval -.04 to .41) for physical 
health. 

The relative stability of the self-efficacy score, calculated 
as a Pearson correlation between the baseline score and 
the respective follow-up score, over twelve months 
(n=708) and 28 months (n=833) was r=.50 and r=.60, 
respectively. The mean self-efficacy score did not change 
over time (F=0.03; p=.97; n=558). 

Discussion 

We constructed a six item short form of the General Self- 
Efficacy scale that showed acceptable psychometric 
properties in different cultures and in non-clinical and 
clinical samples and good concurrent and predictive 
validity in a clinical sample of cardiac patients. The intern- 
al consistency for the 6 item scale (range alpha=.79 to 
.88) was only slightly smaller than the value generally 
observed for the original scale (range alpha=.75 to .94). 
The retest-reliability (.50 to .60) was in the range found 
for the original scale (.47 to .75). The strength of the as- 
sociations with measures of well-being (range .11 to .45) 
and with coping-related measures (range .04 to .34) are 
comparable to the effect sizes of r=.28 and r=.28 found 
in a meta-analysis of six different clinical and non-clinical 
samples [14]. 

In comparison with the original ten item scale, the short 
form can lead to time and resource savings of 40%. On 
the other hand a reduction in length generally leads to a 
loss of reliability and validity. These losses are to be 
traded against the benefits in the context of the intended 
use of the instrument [17]. Especially in large multivariate 
studies or for screening purposes the short form may be 
useful. 

There are some limitations to our study. Ideally, the 
overlap of the short and full form should have been shown 
using independent administrations, because scoringthe 
long and short form from one administration leads to 
overestimation of the correlation [17]. In future research, 
the reliability and validity of the GSE-6 should be tested 



in other languages and countries as well as using other 
clinical and non-clinical samples. 

Conclusions 

The six item short form of the GSE scale is a reliable and 
valid instrument that may be useful for the economical 
assessment of general self-efficacy in large multivariate 
studies and for screening purposes. 
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